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Abstract. 

The formation of massive stellar clusters is intricately linked to star formation on 
local and global scales. All actively star forming galaxies are forming clusters, and the 
local initial conditions likely determining whether bound massive clusters or unbound 
associations are formed. Here, we focus on observed scaling relations between cluster 
populations and the properties of the host galaxy. In particular, we discuss the relations 
between the fraction of U-band light from clusters vs. their host galaxy as well as the 
brightest cluster vs. population size and host galaxy star formation rate (SFR). We also 
pH ] discuss the the fraction of stellar mass formed within bound clusters within the Galaxy, 

nearby dwarf galaxies, as well as starbursts and mergers. Bound clusters appear to 
represent ~ 10% of star formation within most galaxies, although there are intriguing 
hints that this fraction systematically increases in galaxies with higher star formation 
rate surface densities. Throughout the review we highlight potential avenues for future 
study. 



1. Introduction 

Since the advent of the Hubble Space Telescope (HST) it has been clear that all intensely 
star forming galaxies host ste llar clusters with mas ses comparable to those of the ubiq- 
uitous globular clusters (e.g. jHoltzman et ailll992 ). However, their blue colours imply 



ages less than ~ 1 Gyr suggesting that their formation is directly linked with the intense 
star forming events that their hosts are undergoing. The post-starburst/merger galaxy 
NGC 7252 is often taken as a template, as it hosts an enormous cluster population that 
has an ag e distribution that peaks during the infer red merger between the host spirals 



(e.g.. lMilleret al. 1997; Sch weizer & Seit zer 1998). 



In this review, our aim is to link the properties of the cluster population with that 
of the host galaxy, with a focus on young massive cluster s (YMCs), i.e. cluste r s with 
masses above - 10 4 M o and ages less than ~ 1 Gyr (see iPortegies Zwart et al.l ( 20101) 



for a recent review), although we note that the continuous age and mass distributions 
of clusters makes this distinction somewhat arbitrary. In order to begin we need a 
working definition of a c l uster. For this we will adopt the definition put forward by 
Gieles & Portegies Zwartl (1201 1 ), that a cluster is a stellar grouping whose age is greater 



than a current dynamical crossing time. This naturally includes gravitationally bound 
stellar groupings and excludes unbound expanding associations. One implication of 
this definition is that young objects (commonly referred to as embedded clusters) are 
not included as they are not yet old enough to have had their stars cross from one side 
to the other. This avoids the complication of defining "clusters" at young ages, where 



1 



2 



Nate Bastian 



we see a continuous surface density distribution (iBressert et alJl201Ci) . i.e. where there 



%, i.e 
»1 



is not a clear separation between "clustered" and "distributed" stars [ 

Recent dynam ical studies of res olved young massive clust ers in the Galaxy (e.g., 



Cottaar et ai1l2012l) and in the LMC (He nault-Brunet et all2012h have shown that they 



are in dynamical equilibrium from a very young age, suggesting that once clusters 
emerge from their embedded state they are long lived objects. However, in some cases 
further cluster disruption must be take n into account when studying the properties of 
cluster populations (e.g.. lBastianll201 lh and we will explicitly mention when such cor- 
rections have been applied. 

The goal of the studies discussed in this review, is to compare cluster (population) 
properties with those of the host galaxies. However, for many of the interesting galaxies 
(e.g., high SFR galaxies) which are relatively distant (tens of Mpc), it is difficult to 
distinguish between bound clusters and unbound associations. After 10-20 Myr, it is 
relatively easy to make the distinction between between bou nd and unbound stellar 
groupings dGieles & Portegies Zwartll201lUBastian et al.ll2012h . but since young stellar 
clusters/associations outshine their older cousins (hence will be over-represented in any 
sample), unbound associations may contaminate many of the samples discussed here. 
This caveat will be discussed throughout. 

In the following sections we will discuss three empirical relations that have been 
found between individual YMCs, full populations, and their host galaxies. In §|2]we 
discuss the fraction of U-band light coming from clusters relative to the host galaxy's 
SFR surface density (E sfr)> while in §[3] we focus on the relation between the brightest 
cluster within a population and the galactic SFR. Finally, in § 0] we review attempts 
to derive the fraction o f star formatio n that happens in bound clusters as a function of 
galactic properties (T - lBastianll2008h . 



2. The Fraction of U-band Light in Clusters 



Using the Faint Object Camera onboard HST iMeurer et al. dl995h measured the frac- 
tion of UV light coming from compact objects compared to the full galaxy in a sample 
of starburst galaxies. The UV ttaces the light from young, high mass stars (although 
there is a significant component from > 100 Myr old stars) so this should be a proxy 
for the fraction of stars forming in clusters. Meurer et al. report fractions ranging from 
~ 20 - 50%, along with a trend for th is fraction to increase w i th the star formation 
rate surface density (E sfr)- iZepf et all ( 1999h and lAdamo et al. (2011) found that the 
ongoing galaxy merger NGC 3256 and blue compact galaxies (BCGs), respectively, 
also match the Meurer et al. relation. At face value these results suggest that a higher 
fraction of stars form in clusters in galaxies with high E sfr- 

There are two important caveats to these results. The first is that the studied 
galaxies span a large range of distances from ~ 4 to tens of Mpc. In nearby galax- 



ISchweizeil (2006) summarised this confusion well. He writes: "In my opinion, it is unfortunate that this 
loose, non-astronomical use of the word cluster may reinforce an increasingly popular view that most stars 
form in clusters. By the traditional astronomical definition of star clusters as gravitationally bound aggre- 
gates, most of the objects tallied ... in The Antennae are not clusters, but likely young stellar associations. 
It seems to me in much better accord with a rich body of astronomical evidence gathered during the past 
50 years to state that-although star formation is clearly clustered-even in mergers gravitationally bound 
clusters (open and globular) form relatively rarely and contain < 10% of all newly-formed stars'' 
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ies (with generally low S sfr) individual clusters can be resolved, while for the distant 
starburst galaxies we are only be ab le to resolve cluster complexes with potentially a 
large amount of inter-cluster light. lAdamo et al.1 ( 201 ll) discuss this caveat in more 



detail, although a detailed resolution study has not yet been undertaken. The second 
caveat to this result is that it assumes that the average extinction towards clusters is the 
same as it is to the field, which given the large amounts of wind blown bubbles around 
mas sive clusters, may not be j ustified. 

lLarsen & Richtlerj d2000h extended the Meurer et al. study by looking at a sample 
of nearby dwarf and spiral galaxies. They used ground based multi-band data to select 
young (few hundred Myr or less, but avoiding Ha emitting sources) clusters over the 
full extent of the galaxies. They defined T//U) to be the percentage of light in the U- 
band coming from the clusters in their sample compared to the full galaxy, i.e. Tl - 
100 * ^clusters /^galaxy ; known as the specific luminosity. Their data is shown in Fig.Q] 
as open triangles, and BCGs from Adamo et al. are shown as filled circles. Both 
samples are consistent with the trend observed by iMeurer et al. I dl995h for starburst 
galaxies. Because the galaxies in the Larsen & Ritchler sample are all relatively nearby, 
they should not suffer from the first caveat discussed above, however the second caveat 
remains. 

While the T^(U) is a good proxy for the fraction of star formation happening in 
bound clusters, due to it being a purely observational quantity with limited assumptions, 
other methods have been developed, which are discussed in turn below. No theoretical 
study has investigated the T^(U) vs. X sfr relation in detail. Hence it is not clear to 
what extent T/,(U) traces the fraction of star formation happening in bound clusters 
or how much the star formation history of the galaxy, cluster disruption, or differential 
extinction affect the observed values. 



3. The Brightest Clusters vs. Star Formation Rate 



Many of the propertie s of young clu ster systems are dominated by size-of-sample ef- 
fects dWhitmordbool : lLarsenll2004h . meaning that the larger the cluster population is, 
the more likely it is to sample the extreme ends of the distributions (i.e. forming very 
bright, high mass, or large radii clusters). This can be seen in the relation between the 
luminosity of the brightest cluster (M^ nghtes t ) vs. the number of clusters (above a certain 



magnitude cut) within the population (e.g.. Iwhitmorei 12003 ). 

iLarsenl (120021) found a relatively tight relation between M y " 
formation rate (SFR) of the host galaxy. His data are sho wn as open triangles inFigj2l 



and the star- 



and th e dashed line is the best linear fit through the data (Weidner et al. 2004). B astian 



(2008) expanded the relation b y adding clust ers/galaxies at the extreme ends of the dis- 
tributions (open squares), and Adamo et all ( 20111) placed their BCGs on the diagram 
as well (filled circles). As can been seen there is a clear relation that holds for over six 
orders of magnitude in SFR, with high SFR producing brighter most luminous clusters. 

A nice feature of size-of-s ample effects is that it is straightforward to model them 
using monte carlo techniques. Bastian (2008) carried out such a study by generating 
large numbers of cluster populations. This was done as follows. First, assume that all 
stars form in clusters and that the clusters are sampled from a mass distribution, either 
a pure power-law with index, -2, or with a Schechter function (a power-law at the low 
mass end, followed by an exponential distribution above a certain mass). Then, ran- 
domly assign each cluster an age between and 100 Myr in order to mimic a constant 
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Figure 1 . The relation between the percentage of U-band light coming from clus- 
ters compared the full galaxy as a function of £ sfr- The figure is taken from 
Ada mo et all (1201 ll) and includes the data from lLarsen & Ri chtler (2000). 



cluster formation history. The SFR of each population can be found by dividing the 
total mass formed in clusters by the duration of the experiment (100 Myr). Each cluster 
in the population was then assigned a luminosity and colour, by adopting simple stellar 
population (SSP) models of the appropriate age and mass. The most luminous cluster 
was then chosen in each population, and plotted against the SFR for that model. 

The mean result for a pure power-law mass function (index -2) is shown in Fig. |2] 
as a (green) dotted line. The slope of the line is controlled by the form of the underlying 
cluster mass function, with lower power-law indices (i.e., indices below -2) resulting 
in shallower slopes. However, one generic feature of all the simulations, regardless of 
the form of the underlying mass function, was that they always lay above and to the left 
of the observations. The simplest inteipretation of this is that not all star formation is 
happening in clusters. We note that the scatter observed in Fig.|2]is consistent with that 
expected from stochastic sampling of the cluster mass f unction. 

In order to shift the models onto the observations, iBastianl d2008l) defined T to be 
the fraction of star formation that happens within bound clusters. They assume d that 
due to the rel atively young ages of the clusters used in Fig. |2] (see iLarsenl d2009l) and 
Gieles (2009) 'or a more in depth discussion of the age and brightness of the most lu- 



minous cluster of a population) and the inferred high masses of the clusters, that cluster 
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disruption was not significantly affecting the observed relation. With this assumption, 
they found that T ~ 0.08, with no strong dependence with the host galaxy SFR. 

lAdamo et all d201lh note that their BCG galaxies lie systematically above the 
best fit to the obs ervations, suggesting systematically higher T values. Additionally, 
Cook et all ( 20121) found that within their sample of 37 dwarf galaxies, those galaxies 



that had clusters (7) were consistent with the trend observed in Fig. [2 However, a hand- 
ful of their galaxies were expected to host identifiable clusters based on their observed 
SFR, although none were found, which was surprising even taking stochastic sampling 
of the cluster mass function into account. If true, this would point to an additional 
environmental effect which either allows or stops clusters from forming. 

Like in the previous relation, resolution effects may influence the observed trend. 
Most of the galaxies shown in Fig. [2] were studied with HST imaging, or were nearby 
enough to (semi)resolve clusters with ground based imaging. However, if more distant 
systems are included with ground based (lower resolution) imaging, they will appear 
systematically brighter (since cluster formation is in itself a clustered process, hence 
full complexes of clusters can be mistaken for a single cluster). Hence care must be 
taken before adding new sources to the diagram. 

A possible avenue for future work, beyond adding new galaxies, is to extend the 
relation into different bands (i.e. the near-IR) to limit the effects of extinction, although 
the presence or absence of red super giants within the clusters could complicate the 
analysis. Additionally, including not just the brightest cluster but the median (or total) 
luminosity of the top 5 or 10 brightest clusters could significantly reduce the scatter in 
the observed relation and hence allow one to look at more subtle variations of T with 
host galaxy SFR. 



4. The Fraction of Star Formation In Clusters 

While the above relations between Tl(U) , My lghtest , and the properties of the host 
galaxy provide some insight into the cluster formation process, their are only proxies 
for T (the fraction of star formation that happens within bound clusters). A number of 
studies have attempted to estimate T within different environments, and while a number 
of caveats remain, the results appear to be converging to a consistent picture. 

There are a number of ways to estimate T, each with their own caveats, but the 
basic process is to use the observed portion of a cluster population (old and/or low 
mass clusters are faint and hence will not be observable, even if they are present in large 
numbers) to estimate the cluster formation rate (CFR) and compare that to the current 
star formation rate (SFR) through Ha or IR observations or through the reconstructed 
star formation history made through resolved stellar populations. 

4.1. Galactic studies 

The most accessible region to estimate this fraction is in the solar neighbourhood. These 
studies are based on open cluster catalogues, along with estimates of the total local star- 
formation rate. While we can study near by clusters in extrem e detail, current cluster 
catalogues are only complete to ~ 800 pc ( Piskunov et al.l l2008). and were significantly 



more incomplete in previous decades. 

As noted in the introduction, we will not discuss the fraction of stars forming 
in "embedded clusters", as this dep ends largely on th e definition adopted to separate 



"clusters" from "distributed" stars dBressert et al.ll2010l) . 
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Figure 2. The SFR vs. M^, nshtest r elation for a sample of galaxies. The plot is 
reproduced fr om [ Adamo et alJ (I201 lh . The dashed line is the best fit through the 
iLarsenl (l2002l) data (open triangles) while the (green) dotted line represents the ex- 
pectations from a series of monte carlo models with a ll star formatio n happening in 
clusters and a power-law mass function with index -2 (Bastian 2008). 



Most early studies (e.g.. iRobertsll 1 957 ) focussed on the fraction of O and B stars 
that form within clusters or associations relative to a purely isolated envi ronment. They 
found that the vast majority are formed in groups. Ivon Hoerner (1968) estimated the 
cluster and star formation rate in the solar neighbourhood based on then current cluster 
catalo gues and conc luded that ~ 20% of star formation happened in bound clusters, 
while Wielenl (1197 lb suggested that only a relatively small fraction (of order 2 - 5%) of 
stars form in open clusters. Most subsequent studies, which are fundamentally based 
on the s ame technique of est i mating the CFR and SFR, have converged on a value of 
~ 10% dMiller & Scaldl 19781 : Adams & Myers l200ll ; lLamers & Gielesll200rjh . 

One outlying study was that of iPiskunov et al. I (120081) who found a much higher 
value of T ~ 40%. This was based on a more complete catalogue of clusters. However, 
one essential ingredient in all such studies is the accurate estimation of each cluster's 
mass and age. These authors did not use the usual method of counting the number of 
stars within a cluster (above some magnitude cut) or comparing the cluster absolute 
magnitude to SSP models (of the appropriate age) to estimate each clusters mass. In- 
stead, they fit King profiles to confirmed cluster members and estimated the mass based 
on the truncation radius of the best fit profile. This method is extremely sensitive to any 
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error in the truncation radius (to the third power) and it also assumes that young clus- 
ters are tidally truncated. Comparing their derived masses for a number of well studied 
clusters (e.g., the ONC) with in depth photometric studies suggests systematic devia- 
tions, which could explain the high value of T. Using the Piskunov et al. sample, but 
with masses esti mated on the number of s tars (and age) above a certain magnitude limit 
for each cluster, Lamers & Gieles (2006) found that ~ 7% of stars formed in bound 
clusters, in good agreement with the previous studies. 



4.2. Extragalactic determinations 

Outside the Galaxy, relatively few direct estimates of the fraction of stars born in clus- 
ters have been made. Using size- of-sample effects (the most massive cluster in different 
age bins), iGieles & Bastianl (120081) estimated that optically selected clusters (older than 
~ 7 Myr) in the SMC represented 2 - 4% of the total star-formation in the galaxy. 
By comparing th e location of clusters/associations with the location of Ho- emission, 



Fall et al.l d2005h report that the majority (> 80%) of star formati on within the An 



tennae galaxies happens within clusters or associations. However, iSchweizeil (2006) 
suggests that even within galaxy major mergers, < 10% of star formation bound star 
clusters. 



Goddard et al. (120101) have estimated Y for a sample of galaxies (SMC, LMC, 



NGC 1569, M83, and NGC 3256) by extrapolating the observed part of the cluster dis- 
tribution (i.e., the age and mass of the young and massive clusters) to estimate the total 
population (they also provide a detailed analysis of the associated errors in converting 
the observed to the total population), and find an average of ~ 10%. However, they 
report a trend of increasing T as a function of increasing £ sfr- Their data are shown 
as open diamonds in Fig. [3j and the reported best fit relation is shown as a solid line 
(r(%) = 29 x £°p^, where X sfr is in solar masses per year per kpc -2 ). Some of the 
values were calculated only using clusters with ages less than 10 Myr, whereas others 
were calculated using clusters with ages between 10 and 100 Myr. Hence, it is possi- 
ble that if cluster disruption operates strongly over ~ 10 Myr time scales, this could 
influ ence the results. 

ISilva- Villa & Larsenl (1201 lh studied nine regions in a sample of four nearby star- 



forming spiral galaxies using multi-wavelength HST imaging. The found values of T 
between 2% and 10%, in good agreement with previous studies. Depending on how 
they corrected for cluster disruption, their values for individual galaxies varied by a 
factor of 2. These authors did not find any strong correlation with X sfr (their data are 
shown open squares, with the two colours representing different disruption corrections), 
although their galaxies only covered a relatively narrow range of X sfr- However, the 
authors did find a correlation with T^(U) , suggesting that T^(U) may act as a good 
proxy for T, although an exact mapping between the two has yet to be reported. In- 
creasing the sample of galaxies that have had T estimated and T^(U) measured could 
provide a large step forward in our understan ding of cluster for mation and destruction. 



Also shown in Fig.|3]are the BCGs from Ada mo et al.l(l201 lh (filled circles). These 



studies are based largely on young clust ers, so that cluster d isruption should not be a 
strong factor. However, as discussed in lAdamo et al.1 ( 201 lh . many of these galaxies 



are quite distant, meaning that individual clusters cannot be resolved, hence the sources 
within their catalogues may be cluster complexes. This would lead to systematically 
higher T values. 
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Cook et all (120121) estimated T for a large number of low SFR dwarf galaxies as 
part of the ANGST HST survey. They found th at summing thei r entire sample together 
led to a T value of 2-8%, consistent with the iGoddard et al.l d201dh relation. How- 
ever, a handful of their sample were inconsistent with expectations, even accounting 
for stochastic sampling of the cluster mass function, potentially suggesting that a sec- 
ond parameter was relevant in whether a cluster was formed or not. 

As noted by the authors, the above studies have a number of caveats associated 
with them. There is scope to improve upon this first generation of extragalactic studies, 
stalling with understanding resolution affects and identifying clusters, to the derivations 
of cluster properties and the assumption of a constant star formation rate of the host 
galaxy over -100 Myr timescales. One such study that is likely to overcome many of 
these limitations is the The Panchromatic Hubble Andromeda Treasury (PHAT) survey 
(iDalcanton et al.ll2012h . This survey will cover ~ 1/3 of M31 with multiple filters and 
allow the identifica tion of thousands of stellar clusters along with providing estimates of 
their ages/masses (Joh nson et al.ll2012h . The survey will also provide spatially resolved 
star formation history based on individual resolved stars. Combining the cluster catalog 
with the star formation history will allow the determination of T for different regions 
of the galaxy to test for any dependence on SFR, £ sfr ; galactocentric distance, ISM 
propertes, etc. 
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Figure 3. The relation between E sfr and the fraction of star form ation that hap- 
pens i n bound clusters (F). The solid line i s the best fit through the IGoddard et al.l 
d2010h relation. The plot is reproduced from lAdamo et alj d201 ll) . 
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There has been recent theoretical advances in our understanding of T and its po- 
tential variation with environment . This is beyond the scope of the current review, but 
we point the interested reader to iKruijssenl (120121) who presents a theoretical frame- 
work in order to understand these results, linking the density distribution of the ISM to 
star/cluster formation. 



5. Concluding Remarks 

While a significant amount of work has been invested into finding and understanding 
the above relations, the field is still at a relatively youthful stage. Many caveats remain 
in the observations/interpretations and theory is just beginning to catch up. 

It is interesting to note that some of the observed properties (i.e., T/,(U) and T) 
correlate with S sfr while others (My lghtest or the total number of clusters with a popu- 
lation) correlate with SFR. The reason for this could be that M^ n § htest ( or total number) 
is dominated by size-of-sample effects which is more related to the host SFR, i.e., form- 
ing more stars results in more clusters which sample the distributions more fully. For 
example, doubling the SFR does not change the fraction of stars that end up in clusters, 
it simply doubles the number of clusters present. Properties such as T/,(U) or T reflect 
relative quantities, i.e., what part of the star formation hierarchy ends up in bound clus- 
ters, and hence may depend on the physical properties of the host (or more likely the 
physical properties of the ISM within the host galaxy). Changing E sfr may influence 
the ISM, which in turn may affect the distribution of star formation, altering the fraction 
of star formation occurring in bound clusters. 

Finally, we note that the relations discussed here apply only to young clusters 
within galaxies. While there are many links between YMCs and globular clusters (c.f., 
Portegies Zwart et al.1 12010) there are other hints that they may have formed in very 



different conditions. One recent example is a study of the metal poor globular clusters 
in the Fornax dwarf galaxy. These clusters represent ~ 25% of all the metal poor stars 
within the galaxy, suggesting an extremely high T value, a cluster mass func tion devoid 
of low mass clusters, and negligible cluster disruption (lLarsen et al.ll2012 ). Hence, it 



remains to be seen if and how YMCs can be used as templates for globular cluster 
formation. 
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